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Abstract 

We derive a Markov basis consisting of moves of degree at most three for two- 
state toric homogeneous Markov chain model of arbitrary length without parameters 
for initial states. Our basis consists of moves of degree three and degree one, which 
alter the initial frequencies, in addition to moves of degree two and degree one for 
toric homogeneous Markov chain model with parameters for initial states. 
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^ : 1 Introduction 

Q . Consider a Markov chain X^, t = 1,...,T(> 3), over finite state space S. Let u 

O ! (-Si, • • • , St) € S^ denote a path of a Markov chain. In this paper we discuss Markov 

bases of toric ideals arising from the following statistical model 



><| . p{uj) = cPs^S2---l3sT.iST, (1) 

where c is a normalizing constant. In Pachter and Sturmfels |5|, the model ([I]) is called 
a toric Markov chain model. In Takemura and Hara [Si] we considered a model with 
additional parameters 7^, s G iS, for the initial states: 

P{UJ) = C-/sJs,S2 ■ ■ ■ PsT-lST (2) 

and derived a Markov basis of toric ideals arising from ([2]) for the case oi S = {1,2} 
(arbitrary T) and the case of T = 3 (arbitrary S). In Takemura and Hara p] we called the 
model ([2]) a toric homogeneous Markov chain(THMC) model. The model ([T]) corresponds 
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to THMC model with 71 = ■ ■ ■ = 7|5|. For distinguishing two models, we call ([T]) a THMC 
model without initial parameters. 

In the present paper, we generalize the result in Takemura and Hara |8| to the model 
([T]) and derive a Markov basis for the case S = {1,2} and arbitrary T. 

From a statistical viewpoint, Markov bases are used to test goodness-of-fit of the model 
based on the exact distribution of a test statistic. A data set of paths is summarized in 
an |iS|^ contingency table. The set of contingency tables sharing sufficient statistic is 
called a fiber. A Markov basis is defined as a set of moves connecting every fiber. In this 
paper we consider the problem in the framework of contingency table analysis and derive 
a Markov basis as a set of moves which guarantees the connectivity of every fiber. 

The organization of the paper is as follows. In Section 2, we introduce some notations 
and terminologies and give some preliminary results. In Section 3, we state the main 
theorem and give a Markov basis for the model ([1]) with S = {1,2}. We give a proof of 
the theorem in Section 4. A numerical example with a real data set is given in Section 5. 
In Section 6 we end the paper with some concluding remarks. 

2 Preliminaries 

2.1 Notation and terminology 

Let oj = {si, . . . ,st) be a path. For notational simplicity we sometimes write u = 
(si . . . St) or u = Si ... St- If Si = ■ ■ ■ = st = i, we call co a fiat path at state i. If 
for some t (1 < t < T), si = --- = St = ij^j = St+i = ■ ■ ■ = st, then we call u a 
single-step path from i to j at time t. We say that a path u starts at i and ends at j if 
si = i,ST = j- The set of such paths is denoted by Wi^...^j. We call u a non-fiat cycle, if 
it starts from i, visits j ^ i on the way and ends at i. We denote the set of non-fiat cycles 
starting from i by 

Wi^j^i = {u \ si = i,St = j, St = i, for some 1 < t < T}. (3) 

In Figure [H we depict examples of a fiat path, a single-step path and a non-fiat cycle. 

s\t 12 3 4 ^\^ I 
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(i) a fiat path (ii) a single-step path (iii) a non-fiat cycle 

Figure 1: A fiat path, a single-step path and a non-fiat cycle for T = 4 

A data set of n paths is summarized in an |iS|"^ contingency table x = {x{u)),oj G 



S^} of total frequency n, where x{u) denotes the frequency of the path u. Let x* 



«j 



X]a;s =i s =j ^i'^) denote the number of transitions from st = iio St+i = j m x and let 



Ju):st=t,St+\=j 



^i ~ Swst=i''^('^) denote the frequency of the state St = i in x. In particular x\ is the 
frequency of the initial state Si = i. Let 

T-l 
t=l 

denote the total number of transitions from i to j in x. 

The sufficient statistic for the model ([1]) is given by the frequencies of transitions 

b = {x±,t,jeS}. (4) 

The sufficient statistic for the model ([2]) is given by the union of (^ and the set of initial 
frequencies, 

b* = {xlie S} u {x±,ij eS}Db. 

For 5 = {1, 2} we write the elements of b in (jl]) as bu, bi2, ^21, ^22- 

If we order paths appropriately and write a; as a column vector according to the order, 
b in (jl]) is written in a matrix form 

b = Ax, 

where A is an |5p x |iS|"^ matrix consisting of non- negative integers with the rows indexed 
by |iSp transitions and the columns indexed by |iS|^ paths. The {{i,j),u) element A is 
the number of occurrences of the transition from i to j in the path u. 

A is the configuration defining toric ideal I a arising from the model ([I]). A toric 
ideal Ia is the kernel of the homomorphism of polynomial rings tp : k[{p{uj),uj G S^}] — ?■ 
k[{(3ij,i,j G S}] defined by 



tp : p{u) f-> (3s^s2 ■■■(3. 



St— 1ST 1 



where {p{u),uj G S^} is regarded as a set of indeterminates. Algebraically a Markov basis 
is defined as a set of generators of I a (Diaconis and Sturmfels [2|). 

The set of all contingency tables sharing b is called a fiber and denoted by J-f, = {a; G 

Z!>Q I Ax = 6}, where Z>o = {0, 1, . . . }. A move z for the model (^ is an integer array 
satisfying Az = 0. In contingency table analysis a Markov basis is defined as a finite set 
of moves Z satisfying that for all b and all pairs x and y in J-5 there exists a sequence 
zi, . . . ,Zk ^ 2 such that 

K I 

y = x + ^Zk, x + ^Zk>Q, l = l,...,K. (5) 

fc=i fc=i 

In this article we provide a Markov basis as a set of moves satisfying ([5]) for the model 
^ with 5 = {1,2}. 

A move z is expressed by a difference of two contingency tables x and y in the same 
fiber: 

z = X — y, z{u) = x{u) — y{u}), u G S^ . 

We write z\^ = x^j - y\j. 




2.2 Some classes of moves 

In this section we introduce some classes of moves for the model ([2]) discussed in Takemura 
and Hara [8|. Since b G b*, the moves introduced here are also moves for the model ([T]). 
Denote St-.t' = St . . . St' and s'^.^, = s'^. . . . s[, for t < t'. If t > t', let s^t' be an empty 
sequence. Let u = (si, . . . , st) be a path satisfying st^ = St-^ = St^ = i for 1 < to < i^i < 
t2 <T and St = j ^ i for some to < t < t2. Then consider the following swapping 

Then an integer array z = {z{w),w G S^} 

1 if a; = a; 
z{uj) = { -I iiuj = co' (6) 

otherwise 

forms a move for ([2]). We call this move a degree one move. 

We depict a degree one move with T = 4, to = 1, ti = 2 and t2 = 4 as 



(7) 



where a solid line from (z, t) to {j, t + 1) represents 2;*- = 1 and a dotted line from [i, t) to 
{j,t + 1) represents z*^ = — 1. We call a graph like (|7]) a move graph. A node of a move 
graph is a pair (i,t) of state i and time t and an edge from (z,t) to (j, f + 1) represents 
the value of z\y If \z\j\ = 0, there is no corresponding edge in the graph. If \zjj\ > 2, we 
write the value of 1 2*,- 1 beside the edge. 

We say that two paths u = (si, . . . , sy), u' = {s[, . . . , s^) meet (or cross) at the node 
{i, t) Hi = St = s[. If oj and u' cross at the node {i, t), consider the swapping of these two 
paths like 

{U, U'} = { (Sl:t_l, Z, St+l-.r), (s', i, St+1:T) } 

^ { {Si:t^i,i,s[^^.j.), {s',i,St+l:T) } =: {uj,uj'}. 

Then the integer array z 

1 if a; = tD or w' 
2;('a;) = ■^ —1 ii u = u or u' (8) 

otherwise 

forms a move for ([2]). We call this move a crossing path swapping. As shown in Takemura 
and Hara |8I] , a crossing path swapping is expressed by the difference of two tables with 
the same edge-sign pattern and hence the move graph for a crossing path swapping has 
no edge. As shown in Takemura and Hara [8l] , if a^ and y are in the same fiber such that 
the move graph of z has no edge, x and y are connected by crossing path swappings. 



Suppose that to 7^ ^i- Choose (not necessarily distinct) four paths ui,U2, 003,004, as 

^1 = (Sl,l:to-li 1> 1' •*l,to+2:T)? OO2 = (S2,l:to-l ) 2, 2, S2,to+2:T) 5 
1^3 = (*3,l:ti-l) 1) 2, ■S3,ii+2:T)? OO4 = (S4^i.t^_i, 2, 1, S4,ti+2:T); 

where Sk,t:t' = Sfc,*, . . . , Sfc,t'- Then we consider the swapping {uji,uj2, 003, 004} ^ {ooi, 002, 003, 004}, 
where 

'^l = (Sl,l:to-li 1> 2, S2,io+2:T)5 OJ2 = {S2,l:to-lj'^Ai ^l,to+2:T) , 
^3 = (S3,l:ti-1) I7 1) 'S4,ti+2:r)5 '^'4 = (S4,l:ti-1, 2, 2, S3,ti+2:r)- 

Then the integer array z 

{1 if oj = oJi, . . . ,a;4, 
—1 if w = OJi, . . . ,U4, 
otherwise. 

is a move for ([2]) and is called 2 by 2 swap. The corresponding move graph is depicted as 

to ti 

(9) 



For S = {1,2} it can be shown that we can always choose either ui = u^, U2 = 004 or 
Ui = U4, 002 = 003 (Takemura and Hara |8|). Therefore for 5 = {1, 2} a 2 by 2 swap always 
corresponds to a degree two move z. 

Next we consider the following swapping for T > 4: 

{Ui,U2) := {(sto-l,l,l,2,Sio+3:T),(s'i:t,_i,l,2,2,S;^+3^y)} 

O {uJi,(:J2) ■■= {(Sl:to-l,l,2,2,Sto+3:T),(si:t^_l,l,l,2,Sj^_^.3^2^)}. 



Then the integer array z 



1 if OJ = 00i,U2, 

z{u) = ■{ —1 iiu = 001,002, (10) 

otherwise. 



also forms a move for ([2]), which can be depicted as 



to ti to ti 

or 




3 Main result 

In Takeinura and Hara [8] we provided a Markov basis for the model ([2]) as follows. 

Proposition 1. A Markov basis for S = {1,2}, T > A, for ([2]) consists of (i) degree one 
moves ^, (a) crossing path swappings, (Hi) 2 by 2 swaps ^, (iv) moves in ^^). For 
T = 3 a Markov basis consists of the first three types of moves. 

Since 2 by 2 swaps for iS = {1, 2} corresponds to degree two moves, ([2]) has a Markov 
basis consisting of moves of degree at most two. 

We call degree one moves in (^ type I degree one moves. We note that the moves 
introduced in the previous section are moves for ([2]) and hence do not alter the initial 
frequencies. For connecting fibers for the model ([1]) we need moves altering initial fre- 
quencies. First we present a degree one move altering the initial frequency. Consider a 
non-flat cycle u G Wi^^j^^i in (|3]), vising j at time t. Consider the following degree one 
move: 

Wi^j^i 3 u = {si,. . .,St) ^ uj' = {st, . . . , St-1, Si,..., St) G Wj^i^j. (11) 

u and u' has the same number of transitions, but the initial states are different. We call 
this move a type II degree one move. An example of a type II degree one move for T = 4 
is depicted as 

^^ • (12) 




Next we consider a degree three move. Let l<a<b<T— l,a + b<T— 1. Choose 
1 < u <T — 1 such that 

a + iL<T-l, b+{T-l-u) <T -1. 

The range of such u is 

b<u<T-l-a. 

Consider the following set of three paths: 

V^i = {11 . . . 1, 1_^2_^, 1_^2_^}, 

a T-a b T-b 

which consists of a fiat path at z = 1, a single-step path going from 1 to 2 at time a, and 
a single-step path going from 1 to 2 at time b. Choose u G {b, . . . ,T — 1 — a} and let 
a' = a + u, b' = b + T — 1 — u. Consider 

W2 = {22...2, 1...12...2, 1...12...2}. 

a' T-a' b' T~b' 



Then integer array z 



1 iiueWi, 
z{uj) = { -I if w e W2, (13) 

otherwise 



is a move for ([2]). We call this move a degree 3 sliding move. An example of a degree 3 
sliding move with T = 4, a = l, b = 2 and m = 3 is depicted in Figure |2] (i). As a degree 
3 sliding move, we also consider the time reversal of f lT3|) which involves single-step paths 
going from 2 to 1 as depicted in Figure [2] (ii) ■ 



Figure 2: Sliding moves 

Now we state the main theorem of this paper. 

Theorem 1. The set of type II degree one moves of (1111) and the set of degree 3 sliding 
moves in (1131) . in addition to moves in PropositionUl constitutes a Markov basis for ([2]). 

We give a proof of this theorem for T > 4 in the next section. Note that the above 
statement also covers the case T = 3, namely, for T = 3, we do not need moves of type 
(iv) in Proposition [H 

At this point we briefly discuss the case of T = 3. Although we can prove Theorem [T] 
for the case T = 3 along the lines of the next section, it is trivial to use 4ti2 (4ti2 team 
PJ) and verify Theorem [H It is interesting to note that the degree 3 sliding move for 

r = 3 

{111,122,122} ^ {112,112,222} 

is indispensable (Takemura and Aoki [6|, Ohsugi and Hibi |4i])- Hence every Markov basis 
has to contain a degree three move for T = 3. 

4 Proof of the main theorem 

In this section we give a proof of Theorem [T] for T > 4. We first take care of two types of 
special fibers. Then we employ a distance reduction argument as in Takemura and Hara 
[8[] and Takemura and Aoki [7]. 

Consider a fiber with bn = or 622 = 0. We have the following lemma. 

Lemma 1. Fibers with bu = or 622 = are connected by type I degree one moves and 
crossing path swappings. 

Proof. By symmetry consider the case bu = 0. Consider an arbitrary fiber J-f, with bu = 
and arbitrary x G J-f,. It suffices to show that we can transform a; to a unique x* G J-5 
by type I degree one moves and crossing path swappings. This can be accomplished as 
follows. We can easily check that by applying type I degree one moves and crossing path 
swappings, we can transform paths w G Wu...^2 in x to following three types of paths, 

1. (1212 ■ ■ ■ 12) when T is even or (1212 ■ • • 122) when T is odd; 

or 



2. (1212- ■■ 1222- ■■2); 



3. (1222 ■■•2); 



so that the number of type 2 paths is at most one. In the same way we can transform 
paths w G W2*...*2 in ^ to 

1. (2121 ■ • • 2122) when T is even and (2121 ■ • • 212) when T is odd; 

2. (2121 ■■ •2122- ■■2); 

3. (22 ■■■2); 

so that the number of type 2 paths is at most one. The paths w G Wu...^i in x are 
transformed to 

1. (1212 ■ ■ ■ 12) when T is even and (1212 ■ ■ ■ 1221) when T is odd; 

2. (1212 ■■■1222 ■■■21); 

3. (122^^^21); 

so that the number of type 2 paths is at most one. The paths w G W2*-*i in x are 
transformed to 

1. (2121 ■ ■ ■ 21) when T is even and (2121 ■ ■ ■ 21221) when T is odd; 

2. (2121^^^2122^^^21); 

3. (22^^^21); 

so that the number of type 2 paths is at most one. Then the resulting contingency table 
is uniquely defined. D 

Next consider a fiber with 612 = or 621 = 0. 

Lemma 2. Fibers with bu = or 621 = are connected by moves of type (iv) in Propo- 
sitionUl crossing path swappings and degree 3 sliding moves. 

Proof. By symmetry consider the case 621 = 0. In this case every path of a contingency 
table is either a fiat path or a single-step path going from 1 to 2. In particular the 
number of single-step paths is 612. Note that a contingency table of a fiber is determined 
by choosing positions of transitions for the single-step paths from 1 to 2. A single-step 
path going from 1 to 2 at time t contains t — 1 transitions 1 — )■ 1 and T — t — 1 transitions 
2 — )■ 2. Therefore once we choose positions of the transitions, the number of transitions 
1 — )■ 1 and 2 — )■ 2 contained in these single-step paths are determined. The remaining 
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number of transitions 1 — > 1 and 2 — )■ 2 belong to flat paths. Therefore the remaining 
numbers have to be multiples of T — 1. This implies that fiber is partitioned into subsets 
by the number of fiat paths at 1 or at 2. Note that two contingency tables x, y with the 
same number of fiat paths at 1 and at 2 have the same initial frequencies. Then these x, 
y are connected by moves of type (iv) in Proposition [T] and crossing path swappings. 

Suppose that initial frequencies of x and y are different. We note that 611 + 622 + ^12 = 
n{T — 1). Let a G {0, 1, . . . , T — 2} be the remainder of bu when it is divided by T — 1 
and /3 be the remainder of 622 when it is divided by T — 1. Then a + [3 + bi2 = n'{T — 1) 
for some integer n' < n. 

Since < a, /3 < T — 2, a + /3 + 612 = (T — 1) for 612 = 1. Then we can see that a 
fiber contains a single contingency table when 612 = 1. 

Now consider the case that 612 = 2. It can be easily verified that if a = T— 2, (5 = T—2 
and the two transitions from 1 to 2 have to occur at t and T — t{l<t<T — 1). Then 
the number of transitions 1 — )■ 1 and 2 — )■ 2 in the single-step paths are both T — 2. Since 
X and y are in the same fiber, the number of fiat paths are the same. Hence the initial 
frequency of x and y is uniquely determined. Therefore x and y in the same fiber are 
connected by moves of type (iv) in Proposition [1] Ifa<T — 2or/3<T — 2, then the 
number of fiat paths at 1 in the fiber takes two consecutive integer values. Without loss 
of generality consider the case a;(ll...l) = y(ll. ..!) + !. Then we can apply a degree 
3 sliding move z to x such that cc'(ll . . . 1) = a:;(ll ... 1) — 1 where x' = x + z. Then x' 
and y are connected by moves of type (iv) in Proposition [TJ 

For the case 612 > 3, there are more than two single-step paths going from 1 to 2. In 
this case we pick two such paths and apply moves of type (iv) of Proposition [H so that 
the locations of transitions from 1 to 2 of these two paths are far apart from those of 
other paths. Then we can slide these two paths to alter the initial frequency. This proves 
the lemma. D 

After proving above two lemmas, it suffices to consider fibers satisfying 

bn > 0, 612 > 0, 621 > 0, 622 > 0. (14) 

We now employ distance reduction argument. Let x and y be two contingency tables of 
the same fiber and z := x — y. Define \z\ := ^^ Yli j l^ljl- ^^ ^ ^^^ V have the same 
initial frequencies, then they are connected by moves in Proposition [H We note that 

u u T 1 T I 

O21 -O12 = Xi -x^ = y^ -y^, 

which implies zl = z^ . In the same way we have z\ = z^ . Therefore, without loss of 
generality we assume 

z\ = a, z^ = a, z\ = —a, zj = —a, a > 0. 

Lemma 3. Assume ([7^ and T > 3. Suppose that a = zl > can not be decreased by 



type II degree one moves. Then x(ll ... 1) > and y{22 ... 2) > or by these moves we 
can transform x and y to x' and y' such that x'{\\ ... 1) > and y'{22 ... 2) > 0. 



Proof. Suppose that there exists a path w = {si, . . . , st) € Wu2*i in x such that St = 2 
for 1 < t < T. Then by a type II degree one move fITT]) with i = 1 and j = 2, we can 
reduce a. Therefore we can assume that all paths in Wu...^:i are (11 ■ • ■ 1). By symmetry, 
we can also assume that all paths of y in W2*...*2 are (22 ■ ■ -2). 

Suppose that x(ll ■ ■ ■ 1) = 0. By the above argument, x has no path in Wu...^i. Hence 
for any path in a;, si = 1 implies st = 2 and st = 1 implies si = 2. Then we note that 
zl = z^ = -a implies y{22 ■ ■ ■ 2) > 0. Therefore either x(ll ■ ■ ■ 1) > or y{22 ■ • ■ 2) > 
is satisfied. We now assume a;(ll ■ ■ ■ 1) > without loss of generality. 

Suppose that y{22---2) = 0. Then for any path in y, Si = 2 implies s^ = 1 and 
St = 2 implies Si = 1. Suppose that u = (si, . . . , sy) G W2*-*i is a path in y. Since 
zl = zj = —a < 0, there has to exist a path u' = {s[, . . . , s'j.) G iyi*...*2 in y. 

If oj and oj' meet at t, by applying a crossing path swapping z in 1^, y is transformed 
to y' = {y'{w), w G {1, 2}-^} G J-f, such that y'{oj) > and u = (si, . . . ,St, Sj+i, . . . , s^) G 
W^2*-*2- Then by the above argument we can assume that y'{22 ■ ■ -2) > 0. 

Next we consider the case where u does not meet any u' G IVi*...*2 in y. Then y has 
only one path in each of W2*...*i and Wu...^2- Let a) and u' be such paths. Suppose that 
u = (2121 ■ ■ -21) and u' = (1212 ■ ■ ■ 12). Then from the assumptions that bn > and 
622 >0 we have y'{22---2) > 0. 

Suppose that both u and u' are single-step paths. If 622 = ^ — 2, 



_ 1, a tu = to or to' 
0, otherwise 

and 612 = 621 = 1. Then x has to contain two single-step paths a) and u' which does 
not meet each other. {td,Lj') is transformed to {00, u') by a 2 by 2 swap iQ. Hence 1^1 is 
reduced. 

If 622 > T — 2, there has to exist another path u" = (s'/, . . . , s'^) in y such that 
s" = s'l_^i = 2 for some t. By a 2 by 2 swap, u and u' are transformed to single-step 
paths at t. Denote them by a; = (22 ■ ■ ■ 211 ■ ■ ■ 1) and u' = (11 ■ ■ ■ 122 ■ ■ ■ 2). Then apply 
crossing path swappings as follows, 

{u,u"} = {i22---2U---l),is';.„sl,.,T)} 
^{(22---2,<+,^^),«„ll---l),} 

{c2;',(22---2,<+i^y)} = {(22---22,4V2:T),(ll---122---2)} 
o{(22---2),(ll---12s;V2^^),} 

Therefore y is transformed to y' such that y'{22 • • ■ 2) > 0. 

Suppose that both u and to' are not single-step paths. Then we can easily see that u 
is transformed to another path u" = {s'(, . . . , s'^) by a type I degree one move. Then u' 
and u" meet somewhere. Therefore in the same way as the above argument by a crossing 
path swapping to y, y is transformed to y' such that y'{22 ■ ■ -2) > 0. D 
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Lemma 4. Assume [I4\ ) and T > 3. Suppose that a = zl > can not be decreased by 
the moves of TheoremUl except for degree 3 sliding moves. Then by these moves we can 
transform x and y to x' and y' which consist of flat paths and single-step paths only. 

Proof. By Lemma [3l we can assume that x(ll ■ ■ ■ 1) > 0. By the argument in the proof 
of Lemma [31 we can assume that paths in x which start and end at 1 are flat paths at 1 
(11 ■ ■ ■ 1). In the same way we can easily show that paths in x which start at 1 and end 
at 2 are assumed to be single-step paths (11 ■ ■ ■ 12 ■ ■ ■ 2). We can also assume that paths 
in X which ends at 1 are flat paths at 1 (11 ■ ■ ■ 1) or single-step paths (22 ■ ■ ■ 21 ■ ■ ■ 1). 

Next we consider a path a; in a; which starts and ends at 2. Suppose that u is not a flat 
path at 2. Then there exists 1 < tg < ti < T such that si-t^^i = (22 ■ ■ -2), St^ = s^^ = 1, 
^ti+i-.T = (22- ■ -2). We now suppose that there exists to < ^2 < ^i such that St^ = 2. 
Then by crossing path swapping of u and a flat path w' at 1, we can transform (w, a;') to 
u G W2*i*2 and u' G ^^1*2*1 as follows. 



^ 



Then we can reduce a by applying a type II degree one move to u'. Hence we can assume 
that Sto:t^ = (11 ■■■1). 

Since x(ll---l) > 0, we can apply a crossing path swapping of u and a flat path 
(11 ■ ■ ■ 1), we can transform them to two single-step paths as 



o ^^ . . . . , .... ^^ . 

Therefore we can transform cc to a;'. In the same way we can show that y is transformed 
to y'. D 

By Lemma m we assume that x and y consist of flat paths and single-step paths only. 
Now by applying degree 3 sliding moves and adjusting the number of flat paths at 1 as 
in Lemma [21 we can adjust the initial frequencies of x and y. This proves Theorem [H 

5 A numerical example 

In this section we give a numerical example for testing THMC model without initial 
parameters against THMC model with initial parameters using a Markov basis derived 
in the previous section. 

Table [D refers to the number of sequences of the sexes of the first four children in order 
of birth for selected Amish families which had more than or equal to four children (Klotz 
[Sj). In the selected families, parents were born before or up to 1910. The families with 
multiple birth are eliminated. Klotz Klotz [3] found some Markov dependence between 
sexes of consecutive children. 
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Figure 3: Sampling distribution of L via MCMC 

Here we consider the test of the uniformity of the initial parameters 71 = 72 . In order 
to verify the uniformity of the initial distribution, we test the goodness-of-fit of THMC 
model without initial parameters against THMC model Hi via Markov basis technique. 
We use likelihood ratio statistic L as a test statistics. For data in Table 1 we have 
L = 0.1219. 

We computed the exact distribution of L via MCMC with a Markov basis derived 
in Theorem 3.1. We sampled 10,000 tables after 5,000 burn-in steps. The histogram of 
the sampling distribution of L is shown in Figure |3l The solid line represents the den- 
sity function of the asymptotic x^ distribution with degrees of freedom 1. The asymptotic 
p- value and the exact p- value are 0.7270 and 0.6461, respectively and hence Hq is accepted. 



Table 1 : The sexes of the first four children in order of birth for selected Amish families 



MMMM 


8 


FMMM 


13 


MMMF 


14 


FMMF 


11 


MMFM 


13 


FMFM 


9 


MMFF 


19 


FMFF 


9 


MFMM 


11 


FFMM 


10 


MFMF 


9 


FFMF 


8 


MFFM 


11 


FFFM 


9 


MFFF 


13 


FFFF 


10 



M : male, F : female 
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6 Concluding remarks 

We derived a Markov basis for THMC model without initial parameters ([I]) for 5 = {1, 2} 
and arbitrary T > 3. The basis consists of moves of degree at most three and the types 
of moves are common for all T > A. For the model ([2]) we had similar "finiteness" result 
in Takemura and Hara [8] with a Markov basis consisting of moves of degree at most two. 
Each fiber of ([2]) is a subset of a fiber in ([1]). This corresponds to the fact that ([1]) 
is a submodel of ([2]), such that the sufficient statistic for ([1]) is a linear function of the 
sufficient statistic for (|2]). Type II degree one moves and the degree 3 sliding moves are 
needed to connect fibers of ([2]) in each fiber of ([T]). It is of interest to consider other 
nested toric statistical models and identify moves which are needed to cross fibers of a 
larger model within each fiber of a smaller model. 
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